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Abstract 

In light of the growing interest in agent-based market models, we bring 
together several earlier works in which we considered the topic of self- 
consistent market modelling. Building upon the binary game structure 
of Challet and Zhang, we discuss generalizations of the strategy reward 
scheme such that the agents seek to maximize their wealth in a more direct 
way. We then examine a disturbing feature whereby such reward schemes, 
while appearing microscopically acceptable, lead to unrealistic market dy- 
namics (e.g. instabilities). Finally, we discuss various mechanisms which 
are responsible for re-stabilizing the market in reality. This discussion 
leads to a 'toolbox' of processes from which, we believe, successful market 
models can be constructed in the future. 



Oxford Center for Computational Finance working paper: OCCF/010702 



1 Introduction 



Agent-based models have attracted significant interest across a broad range of 
disciplines 0. An increasingly popular application of these models has been 
the study of financial markets [|| ||] . The motivations for this focus on financial 
markets are two-fold: the promise that data-rich financial markets are good 
candidates for the empirical study of complex systems, and the inadequacy of 
standard economic models based on the notions of equilibria and rational expec- 
tations. Currently many different agent-based models exist in the Econophysics 
literature, each with its own set of implicit assumptions and interesting proper- 
ties jjj H, Sj. In general these models manage to exhibit some of the statistical 
properties that are reminiscent of those observed in real-world financial markets; 
for example, fat-tailed distributions of returns and long-timescale volatility cor- 
relations. Despite their differences, these models draw on several of the same key 
ideas: feedback, frustration, adaptability and evolution. The underlying goal of 
all this research effort is to generate a microscopic agent-based model which (i) 
reproduces all the stylized facts of a financial market, and (ii) makes sense on 
the microscopic level in terms of financial market microstructure. While each 
of these goals is separately achievable, the combination of (i) and (ii) within a 
single model represents a fascinating challenge, and underpins our motivation 
for writing the present paper. 

Challet and Zhang's Minority Game H offers possibly the simplest paradigm 
for a system containing the key features of feedback, frustration and adaptabil- 
ity. The MG has remarkably rich dynamics, given the simplicity of its under- 
lying binary structure, and has potential applications in a wide range of fields. 
Consequently Challet and Zhang's MG model, together with the original bar 
model of Arthur Q, represent a major milestone in the study of complex sys- 
tems. Given the MG's richness and yet underlying simplicity, the MG has also 
received much attention as a financial market model . The MG comprises an 
odd number of agents N choosing repeatedly between the options of buying (1) 
and selling (0) a quantity of a risky asset. The agents continually strive to make 
the minority decision i.e. buy assets when everyone else is selling and sell when 
everyone else is buying. This strategy seems to make sense on first inspection. 
For example, a majority of agents selling will force the price of the asset down 
and thus ensure a low buy price (which is favorable) for the minority of the 
agents who decided to buy; hence the minority group has 'won'. However, now 
consider the scenario where this pattern repeats over and over. The minority 
of agents, who have been buying more and more assets as the price falls, now 
find themselves holding a huge inventory of assets which are worth very little. 
When these agents try to cash in their assets, they will find themselves very 
much poorer. Surely then the minority group has actually lost? 

This paradox has sparked much discussion as to the suitability of the MG as 
the basis of a microscopic model of financial markets. Consequently a diverse 
range of modified (yet MG-related) models have emerged, all seeking to address 
this point in different ways Q] . Some models include a combination of different 
types of agents - in particular, agents who are rewarded for trading in the 
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minority, and agents who are rewarded oppositely for trading in the majority 
(see Ref. [0 for an early example). In such models, the overall class of the 
game (minority or majority) is ambiguous, and is likely to change as differing 
numbers of each group flood in and out of the market. Other models try to 
combine the two different reward structures, thus requiring only one class of 
agents who strive to be neither exclusively in the minority group nor exclusively 
in the majority group j^, [| We refer the reader to the Econophysics website 
j| for further examples (see also Refs. ||, ||, pd||). 

The purpose of this paper is to consolidate the search for a suitable micro- 
scopic reward scheme for use in agent-based financial markets. The paper com- 
prises a synthesis of several earlier reports, each aimed at addressing the topic 
of self-consistent agent-based market modelling within a binary game structure. 
These earlier reports are re-organized in light of several recent papers on specific 
reward structures, and supplemented by additional simulation results, observa- 
tions and thoughts 0. We start by discussing various strategy reward schemes 
in which the agents seek to maximize their wealth in a direct way. We then 
illustrate a disturbing feature whereby such reward structures, while appearing 
microscopically acceptable, can lead to unrealistic market dynamics (e.g. in- 
stabilities). Finally, we discuss various mechanisms which are responsible for 
re-stabilizing the market in reality. This discussion leads to a 'toolbox' of pro- 
cesses from which, we believe, successful market models can be constructed in 
the future. 

2 Binary game structure 

Consider the situation of a population of N agents with some limited global 
resource. A specific example could be Arthur's famous bar problem where 
there are N regular bar-goers but only L < N seats. The reward scheme in 
Arthur's bar problem is simple: bar-goers are successful if they attend and 
they manage to obtain a seat. This reward scheme implicitly assumes that 
the bar-goers value sitting down above other criteria. While this makes sense 
for many situations, it is not universal. For example, it is our experience that 
customers of college-based bars do not view seating as a necessary requirement 
for a successful evening! The motto 'the more the merrier' often seems more 
appropriate. The MG represents a very special case of the bar problem, in which 
the seating capacity L = (N — I)/2 and all the attendees wish to sit down. In 
more general and realistic situations, the correct reward scheme is likely to be 
less simple than either Arthur's bar model or Challet and Zhang's MG. Until a 
specific reward scheme is defined, the bar model remains ill-specified. Putting 
this another way, the precise reward scheme chosen is a fundamental property of 
the resulting system and directly determines the resulting dynamical evolution. 
Hence it is crucial to understand the effect that different microscopic reward 
schemes have on the macroscopic market dynamics. 

As first discussed in Ref. jij, the bar problem is somewhat analogous to a 
financial market where the bar-goers are replaced by traders. In the same way 
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that a general bar problem requires a non-trivial reward scheme, any 'correct' 
agent-based market model will need a non-trivial reward scheme in order to 
avoid inconsistencies with financial market microstructure. This provides us 
with the motivation to look beyond the bar model and MG, in order to design 
an agent-based model with financial relevance. Having said this, the binary 
structure of the MG provides an appealing framework for formulating a market 
model without introducing too many obvious pathologies. Given these consider- 
ations, our discussion from here on will not assume any specific reward scheme, 
but will employ a binary game structure based on Ref . S . 

We start by reviewing the binary game structure. There are three basic 
parameters; the number of agents N, the 'memory' of the agents to, the number 
of strategies held by each agent s. The agents all observe a common binary 
source of information, but only remember the previous to bits. Hence the global 
information available to each agent at time t is given by \i [t] where in decimal 
notation fi[t] € {0 ... P — 1} with P = 2 m . Each strategy a R contains as its 
elements a R , and represents a response { — 1,1} to each of the possible P values 
of the global information /i. There are hence 2 P possible strategies. The agents 
are randomly allocated a subset of s of these strategies at the outset of the 
game, and are not allowed to replace these. The agents also keep a score Sr 
reflecting their strategies' previous successes, whether the strategy is played or 
not. The net action of the agents A [t] is defined as: 

A[t] = ^2n R [t]af 

R=l 

where ur is the number of agents playing strategy R. The agents always play 
their highest performing strategy, i.e. the strategy with the highest Sr. The 
strategy score is updated with payoff gR such that Sr [t + 1] = Sr [t] +gR [t + 1] . 

The central question is the following: what should we take as 'success' in 
the context of a market and hence what are the payoffs? In short, what is 
the microscopic reward scheme? This is equivalent to asking what the game 
actually is, since a game requires a definition of the players' goals and hence 
their rewards. In order to address this question, we start by considering a binary 
version of Arthur's bar problem with general L, i.e. an MG-like model which is 
generalized to L =/= (N — l)/2 jl^, ^3). The two responses would be 'attend' or 
'don't attend', and there are two possible scenarios: (i) the attendance is less 
than the seating capacity. In this case, the attendees are successful while those 
who didn't attend are unsuccessful, (ii) The attendance exceeds the seating 
capacity. In this case, the attendees are unsuccessful while those who didn't 
attend are successful. Setting L — (N — l)/2 yields the MG, and the two 
scenarios collapse into one: the successful agents are those who predict the 
minority group, i.e. attend when the minority attend, or don't attend when the 
majority attend. [For a study of the binary game with L ^ (N — l)/2, see Ref. 
If [if].] In the specific case of the MG, success is classified as having correctly 
predicted the minority outcome. The payoff function for the MG is thus of the 
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form: 



9R [t + l]=X 



A[t] 



(1) 



where x [ x \ IS an odd, increasing function of x usually chosen to be either 
X [x] = x or x [ x ] — s g n [ x ] ■ The feedback in the MG arises through the global 
information /i [t] where the most recent bit of the binary information is defined 
by the sign of the net action of the agents A [t] : 



fi [t + 1] = 2/i [i]-PH 



r , P 



(2) 



where H [x] is the Heaviside function Jl4| . In the more general form of the game 
where the resource level L ^ [N — l)/2, the argument in the Heaviside function 
H [. . . ] would depend on the value of L [Q. More generally it could become a 
complicated function of all the game parameters, or include feedback from the 
macroscopic dynamics in the past, or even the effects of exogenous news arrival. 
It is now accepted that a generalization of the MG in which agents can sit 



out of the game (i.e. 



= 0) when they are not confident of the success of 



any of their s strategies, provides a better model of financial trading since it 
allows the number of agents active in the market to vary. This more general 
model is the Grand Canonical Minority Game (GCMG) which was first studied 
in Ref. jll). The GCMG typically has two more parameters than the MG: 
namely T which is a time horizon over which strategy scores are forgotten (i.e. 
Sr [t+l] = (1 - l/T) S R [t] + g R [t + 1]) and r which is a threshold value of 
Sr below which the agents will opt not to participate in the game (a M '*l = 0). 
The GCMG seems to reproduce the stylized statistical features of a financial 
time-series over a wide parameter range (see the discussion in Ref. 



3 The price formation process 

It is commonly believed that in a financial market, excess demand (i.e. the 
difference between the number of assets offered and the number sought by the 
agents) exerts a force on the price of the asset. Furthermore it is believed that a 
positive excess demand will force the price up and a negative demand will force 
the price down. A reasonable suggestion for the price formation process could 
then be |§: 

ln[p[t]]-ln[p[t-l]] = ^P (3) 

or 

p[t]-p[t-l] = ^p- (4) 

where D [t~] represents the excess demand in the market prior to time t, when 
the new price p [t] is set and the buy/sell orders are executed. The scale param- 
eter A represents the 'market depth' (or liquidity) i.e. how sensitive a market 
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is to an order imbalance. In general we would expect A to be some increasing 
function of the number of traders N trading in that asset. Also, the functional 
forms may not be linear as in Equations |^ and ^. However, studies carried out 
on several markets have found the forms of Equations [| and |4| to be reasonable 

In Ref. 14], we suggested that in markets involving a market-maker who 
takes up the imbalance of orders between buyers and sellers, the market-maker 
will wish to manipulate the price of the asset in order to minimize her inventory 
of stock ()>m [t]- Whilst the market-maker's job is to execute as many of the 
orders as is possible, she does not wish to accumulate a high position in either 
direction. To this end, the market-maker lowers the price to attract buyers 
when she holds a long position <j>M > and raises it to attract profit-taking 
sellers when she has a short position <j>M < 0. This modifies the price formation 
of Equation || tcfjj: 

tolP[t]]-U[P[t-l]] = ^-+Zp. (5) 

A Am 

where Am is the market-maker's sensitivity to her inventory. 



4 From MG to market model 

Our puzzle is to form a sensible model of a collection of traders buying and 
selling a financial asset. It is clear from Sections || and || that we have two 
pieces of the puzzle. The MG gave us a simple paradigm for a group of agents 
interacting in response to global information and adapting their behavior based 
on past experience. The output of this model was a 'net action' A\t\. In 
addition, we now have some simple systems for price-formation based on an 
excess demand D [t] . But what relationship should we assume between D [t] and 
To answer this question, we should clarify what we want the individual 
actions a^f*' of the strategies to resemble. In short, the question is 'what does 
a strategy map from and to?' 

In most studies of the MG as a market model, researchers have taken the 
actions Uj S {—1,1} to be the actions of selling and buying a quanta of the 
asset respectively. Thus the net action of all agents A [t] naturally becomes the 
excess demand, i.e. we have: 

D[t-]=A[t-l] (6) 

Given this and the fact that the price change is an increasing function of excess 
demand (for Equations || and ^) , the global information becomes the directions 
(up or down) of the previous price changes. So the answer here is that a strategy 
maps from the history of past asset price movements to a buy/sell signal. A 
strategy mapping from the history of past asset price movements seems a natural 

1 It is important to note that this price process is not abitrage-free i.e. it is possible for the 
agents to make profit by manipulating the market maker. 
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and sensible choice, since financial chartists do look at precisely this information 
to decide upon a course of action. We therefore propose that in general, for this 
class of market model, we should always use the following Equation (Eq. [?]) to 
update the global information, irrespective of whether Equation ^ holds. 



[i[t+l] =2n[t]-PK 



r i P 
MM- ¥ 



H [AP [t + l,t]] 



(7) 



It is worth now spending a few moments considering the timings in this 
model. Equation ^ is for the excess demand at time t~, i.e. prior to time t 
when all the orders are processed and a new price p [t] is formed. Thus D [t~] 
can only result from all the information that is available at time t — 1, i.e. 
fi [t — 1] and {Sr [t — 1]}. From this information, the agents take actions a^ 4-1 ' 
producing a net action of A [t — 1]. The agents' actions (orders) however do not 
get realized (executed) until time t when the new price p [t] is known. The MG 
payoff function of Equation |], hence rewards agents positively for deciding to 
sell (buy) assets (i.e. a^* -1 l = —1 (1)) when the order execution price p[t] is 
above (below) the price level at the time they made the decision (i.e. p [t — 1.]). 

Why is this mechanism of moving in the minority a physically reasonable 
ambition for the agents? Let us consider the 'notional' wealth of an agent i to 
be given by: 



Wi[t} = <f>i [t}p[t} + a 



(8) 



where fa is the number of assets held and Ci is the amount of cash held. It 
is clear from Equation || that an exchange of cash for asset at any price does 
not in any way affect the agents' notional wealth. However, the point is in the 
terminology The wealth Wi [t] is only notional and not real in any sense - the 
only real measure of wealth is Ci which is the amount of capital the agent has 
available to spend. Thus it is evident that an agent has to do a 'round trip' (i.e. 
buy (sell) an asset then sell (buy) it back) to discover whether a real profit has 
been made. Let us consider two examples of such a round trip. In the first case 
the agent trades in the minority while in the second he trades in the majority: 

• Moving in minority: 



t 


Action a [t] 


Q [t] 




Pit] 


Wi[t] 


1 


Submit buy order 


100 





10 


100 


2 


Buy..., Submit sell order 


91 


l 


9 


100 


3 


Sell 


101 





10 


101 



Moving in majority: 



t 


Action a [t] 


Ci [t] 




p[t] 


Wi[t] 


1 


Submit buy order 


100 





10 


100 


2 


Buy..., Submit sell order 


89 


1 


11 


100 


3 


Sell 


99 





10 


99 



As can be seen, moving in the minority creates wealth for the agent upon com- 
pletion of the necessary round-trip, whereas moving in the majority loses wealth. 
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However if the agent had held the asset for any length of time between buying 
it and selling it back, his wealth would also depend on the rise and fall of the 
asset price over the holding period. Thus the MG mechanism seems perfectly 
reasonable for a collection of traders who simply buy/sell on one timestep and 
sell/buy back on the next, but this is not of course what real financial traders 
do in general. This pinpoints the main criticism of the MG as a market model. 



5 Modified payoffs 

The minority mechanism at play in the MG, arises solely from Equation [I] for 
the payoff (reward) gji [t + 1] given to each strategy R based on its action a R . 
Thus it can be trivially conjectured that changing the structure of the payoff 
function will change the class of game being played. In Section |4| we pointed 
out that although trading in the minority was beneficial, the minority payoff 
structure itself made no allowance for the rise or fall in the value of the agent's 
portfolio (pi of assets. Let us try to rectify this by examining the form of the 
agent's notional wealth, Equation |[ If we differentiate the notional wealth, we 
get an expression for AW t [t+l,t] = Wi [t + 1] — Wi [t] given by: 

AW t [t + l,t] = AC, [t + l,t]+p[t + 1] A<& [t + 1, t] + (f>i [t] Ap [t + 1, t] 

The first two terms cancel because the amount of cash lost —ACi [t + l,t] is 
used to buy the extra A(f>i [t + 1, t] assets at price p [t + 1]. This leaves us with: 

AWi[t+l,t]=<p i [t}Ap[t + l,t] (9) 

We can then use Equation^] to work out an appropriate reward Qr [t + 1] for each 
strategy based on whether its action a R induced a positive or negative increase 
in notional wealth. Let us first use the fact that the price change Ap [t + l,t] 

is roughly proportional to the excess demand D (t + 1)~ : this can be seen 
explicitly from our earlier equation for the price formation, Equation |4| Then 
let's use our MG interpretation of the net action A[t], Equation ||. We thus 
have from Equation [|: 

AW R [t + l,t]<*4> R [t]X)[(t + l) 
a fl [t] A [t] 

We then identify the accumulated position of a strategy at time t, <f>n [t], to 
be the sum of all the actions (orders) made by that strategy which have been 
executed within times . . . t. Remembering that at time the action (order) 
a R ^ has not yet been executed (it gets executed at t + 1), this gives <f)R [t] — 

Si=o a R^ ■ us then set the payoff given to a strategy gn [t + I], to be an 
increasing (odd) function \ of the notional wealth increase AIVr [t + 1,<] for 
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that strategy. We thus arrive at: 



9R [t + 1] = X 



A[t] 



(10) 



We could also propose a locally-weighted equivalent of Equation [h], in which 
the reward given to a strategy is more heavily weighted on the result of its recent 
actions than the actions it made further in the past. This gives 



9R [t + 1] = X 



t-i 

Ed 



l/Tf-^af 1 xA[t] 



(11) 



where T enumerates a characteristic time-scale over which the position accu- 
mulated by the strategy is 'forgotten'. The T = 1 limit of Equation [ll] yields 



9R [t + 1] = X 



,/*[*- 



1] A[t] 



, i.e. only the position resulting from the most re- 
cently executed trade is taken into account. With T = 1, this payoff structure 
essentially rewards a strategy at time t+1 based on whether the notional wealth 
change AW^ [t + 1, t] was more positive than it would have been if action a^f* 
had not been taken. 
If Equation 
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is used in an agent-based market model, the agents play 
the strategy they hold which has accumulated the highest 'virtual' notional 
wealth. We mean 'virtual' in the sense that the strategy itself will not have 
actually accumulated this notional wealth unless it has been played without fail 
since time t = 0. The agents in this model are thus all striving to increase their 
notional wealth and are allowed to do so by taking arbitrarily large positions fa . 
We will call this incarnation of the market model $G|TT|: the term '$G' is 
named after the 'The $-Game' of Ref. || where agents also strive to maximize 
notional wealth, and '[n]' is named after Equation |ll| for the strategy payoff. 
We stress that this incarnation of the market model is by no means unique. 
For example, it may be more appropriate to consider a model wherein each 
agent is only allowed one position in the asset at any time, i.e. fa [t] = — 1, 0, 1. 
This amounts to us redefining the strategies R as mapping to the actions 'go 



long/short' instead of 'buy/sell', i.e. 



to (t + 1) here because the action a^f*' is not executed until time t + 1: after 



(t+iy 



Note the reference 



this time a 



/'['] 

R 



(>r [(t + 1)] . In this new model, the net action of the agents 



A [t] therefore represents the overall desired position of the population at time 



(t+1) , i.e. A[t] = 
demand is given by: 

D 



N 



(t+iy 



It therefore follows that the excess 



N 

(t+ir] =E(^ [(* +i r] 



A[t]-A[t- 1} 



(12) 
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Similarly, for this system, the payoff to strategies based on the notional wealth 
increase is given by Equation M as: 



g R [t + 1] oc x [fa [t]Ap[t + l,t]] 

(A [t]-A[t-l]) 



(13) 



We will call this incarnation of the market model $G|l3|. The payoff 
structure for this one-position-per-agent system is essentially the same as that 
reported in Equation [3] of 'The $-Game' 0. 



6 Behavior of market models 

In the previous section, we introduced different microscopic reward schemes for 
use in binary multi-agent market models. The aim of these new strategy payoff 
structures, was to get the agents to maximize their notional wealth as given 
by Equation ||. This seems to be a more physically realistic goal than to have 
the agents competing to always be in the minority group, as in the MG and its 
variants. The question now must be: how do these new payoff structures affect 
the resultant dynamics of the market model? To investigate this, we identify 
the form of the excess demand in the model and then employ a price formation 
structure - from Section ^ for example. The table below illustrates three possible 
market models: 



Name 


4>R [t] 


D (i+1) 


g R [i + 1] =sgn[...] 


AP [t + l,t] 


GCMG 


to\t-'A+af- 1] 


A[t] 


-af ] D 






D 


(*+!)" 


/A 


$G[n 




to\t-2]+af- 1] 


A[t] 




l] D (t + 1) - 


D 




A 


$G[13 




„/*[*-!] 

U R 


A[t] — A[t — 1] 


af^D 


(* + !)" 




D 


(t+iy 


A 



We have simulated the three market models in the above table with the 
same parameters for each model: N = 501 agents, s — 2 strategies per agent, 
T = 100 timesteps for the strategy score time-horizon, and an r — 4 payoff 
point confidence-to-play threshold. Each simulation was run for agents with 
low memory m = 3 and for agents with high memory m = 10. The resultant 

price p [t] and volume v [t] — Y1r=i n R are displayed in Figure |1|. 

The two different values of the agent's memory length (or rather the ratio 
P/N) describe markets in two distinctly different 'phases'. Arguably the most 
important feature of these types of model is that there is a common perception 
of a strategy's (virtual) success, i.e. Sr [t] is a global variable. This implies 
that the number of agents adopting the same action (i.e. the effect of crowd- 
ing or herding) is dependent on the number which hold each strategy. At low 
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Figure 1: Behaviour of price and volume in market models corresponding to 
three different strategypayoff functions [t] . Parameters for each model are 
N = 501, s = 2, T = 100, r = 4. The left hand column represents the crowded 
market with high agent coordination (m = 3) the right hand column represents 
an uncrowded market with little agent coordination (m = 10). The top row 
shows the GCMG, the second row shows the $G|IT| and the bottom row shows 
$GjD§ 
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memory (low P/N) there are few strategies available and many agents, conse- 
quently large numbers of agents hold the same strategy. This implies that in 
this 'crowded phase' there are large groups of agents adopting the same action 
Op (i.e. crowds) and only small groups adopting the opposite action —a^ 

18 . The result of this high agent coordination is a volatile 
market with large asset price movements. Conversely, for high m (high P/N) 
hardly any agents hold the same strategy. Hence the crowd size is roughly equal 
to the anti-crowd size, resulting in a market with low agent coordination and 
consequently lower volatility and fewer large changes. With this distinction be- 
tween low and high to regimes in mind, let us briefly describe the dynamics in 
each market model. 



(i.e. anti-crowds) 



• GCMG. First consider low to. The confidence-to-play threshold is set 
high enough in order that occasional stochasticity is injected via timesteps 
when A [t] = (N.B. the next state of /x [t + 1] is then decided with a coin 
toss). The GCMG can be seen to reproduce many of the statistical features 
observed in real markets. Figure j] shows that the activity (volume) is 
generally low and bursty. The asset price series is thus characterized by 
frequent large movements (giving fat-tailed distributions of returns) and 
clustered volatility. No long periods of correlation exist in the GCMG asset 
price movement: as soon as lots of agents start taking the same action, 
then the strategies which produce that action are penalized, as can be seen 
from Equation [[} At high to, the absence of agent co-ordination leads to 
an absence of clustering of activity. Hence the series of asset prices appears 
more random at high m. 

• $G |ll| . At low to, the asset price very soon assumes an unstoppable 
trend: agents with at least one trend-following strategy (. . . A [t — 2] > 
0, A [t - 1] > 0, A [t] > => an = 1) join the trend and benefit (no- 
tionally) from the consequent asset price movement. Because the strate- 
gies (and hence agents) are allowed to accumulate limitless positions, the 
trend is self-reinforcing since APFr just keeps getting bigger for the trend- 
following strategies. At high m, the lack of agent coordination means that 
it is harder for the model to find this attractor. However sooner or later, 
a majority of trend-following strategies will have accumulated sufficient 
score Sr and position \<j>n\ to be played successfully. From then on, the 
pattern of success is self-reinforcing and again the unstoppable trend is 
created. This result has to be seen as the natural consequence of wanting 
the agents to maximize notional wealth at the same time as being allowed 
arbitrarily large positions. 



$G13. At low to, the model has interesting and irregular dynamical prop- 
erties. For example, the series of asset price movements contains periods 
of very high correlation (and high volume) and periods of very high anti- 
correlation (and lower volume). These two behaviors arise from the payoff 
structure of Equation |D| just as in $Gp"l|, strategies are rewarded for 
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having a positive (negative) position when the asset price is trending up 
(down), however they now get penalized for joining that trend. This im- 
plies that persistent behavior will tend to follow persistent behavior, while 
anti-persistent behavior will follow antipersistent behavior. This finding 
is very much in line with the school of thought that says that markets 
have distinct ranging and breakout phases. Unlike the $G|Tl| model, the 
trends are always capped by the limitation that agents can only hold one 
position. Once as many agents as possible are long (short) the asset price 
cannot rise (fall) any further: consequently the resulting dynamics are 
strongly mean-reverting. At high m, due to the lack of agent coordina- 
tion, it is hard for large enough crowds to arise to form trending periods. 
Thus anti-persistence dominates this regime. 

7 Instability in market models 

In the MG, there is no unstable attractor which could give rise to a continuously 
diverging price. The reason is that each agent is trying to do the opposite of the 
others. This results in no net long-term market force. The MG agents can see 
no advantage in collectively forcing a price up (down) to increase the value of 
their long (short) position, since they are totally unaware of their accumulated 
position. Section || showed however that, when we give the agents the goal of 
maximizing their notional wealth (cash plus value of position), the model can be 
dominated by the unstable dynamics of price trending. Although the structure 
of the $G models seems more realistic than the MG, the price dynamics do not 
reproduce many of the stylized features of financial asset price series. It seems 
that an improvement in the microscopic model structure 'spoils' the macroscopic 
results. To help explain this, we now look at some of the model deficiencies which 
still remain. 

7.1 Market impact 

We hinted at this effect in Section [| The wealth Wi [t] that we have forced the 
agents of the $G models to maximize, is purely notional - it is the wealth they 
would have if they could unwind their market positions at today's price p[t]. 
The agents are clearly not able to do this. There are two reasons for this. Even 
if an agent put in an order to unwind his position at time t, it would not be 
executed until time t + 1 when the price is p [t + 1]. The second reason is that 
in this simplistic structure, the agents can only trade one quanta of asset at 
any time. Hence a position of 4>i assets will take <pi timesteps to fully unwind. 
All other things being equal, an agent should expect on average to get less cash 
back than ipi [t]p[t] when unwinding his position. This is 'market impact'. We 
can see this effect directly with the following argument: 

The change in cash of agent i from buying A<^ [t, t—1] assets at time t is given 
by: ACi [t, t — 1] = — A4>% [t, t — l]p[t]. Let us then use Equation || to give 



13 



p [t] = p [t — 1] + D [t ] /A. Then we break down the demand as follows: 



JV 



D[t-]=di + j2 d i [*"] 

where dj is the demand of agent j . However we know that the demand of agent 
i is Acj)i [t, t — 1] assets, hence we get 

ACj[M-l]=-A^i [t,t-l]p[t] 

1 1 N 

= [t, t - l]p [t - 1} - - (A& [t, t - l]) 2 - [t, t - 1} dj 

If we now assume that the remaining agents j j^i form an effective 'background' 
of random demands dj of zero mean, then averaging over this 'mean-field' of 
agents gives: 

(AC, [t, t - = —A<j>i [ t ,t-l]p[t-l]-j (A4>i [t, t~l]f (14) 

Equation |m| tells us that an agent will on average receive an amount of cash 
(Ad [t,t— 1]}, which is equal to the amount he may have naively thought he 
would get (if he could have traded at price p [t — 1]) minus a positive amount 
proportional to the square of his trade size and inversely proportional to the 
market liquidity. Consequently, an agent who considered his market impact 
in calculating his wealth would on average expect the cash-equivalent value of 
his wealth to be given by Wi [t] — 4>i [t] 2 / A, i.e. his notional wealth minus the 
average impact from unwinding his position at time t + 1. This is the wealth 
that the agents should be attempting to maximize. We could therefore propose, 
in the spirit of Ref. [|l9| , a modification to the strategy reward function gp> [t] 
which takes account of this market impact: 

g R [t + 1] oc x [fo [t] Ap [t + 1, t] - <i> R [tf /A] (15) 

Let us now re- visit the issue of trend following, this time with reference to a 
model whose strategies are updated using Equation [l5|. We will refer to this 
model as $G|l5[ When a trend forms such that 4>R [t] and Ap [t + l,t] have 
the same sign, strategy R will be rewarded: this continues until the moment 
when the extra profit gained from increasing the position to benefit from the 
favorable price movement, is smaller than the added market impact which would 
be faced from unwinding the larger position. At this point the strategy will 
start to be penalized instead of rewarded for supporting the trend. This is a 
mechanism which could halt the price-divergent behavior of $G[TT| type models. 
However Equation [l5| gives the turnover point in strategy reward due to market 
impact, to be given by 4>r [t] — XAP [t + 1, t\. If we then consider that during 
an 'unstable' market trend the average price change is of order N/X (see, for 
example, the gradient of the $G[ll] price in Figure - and that (j>R increments 
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by only ±1 every timestep - then trends which are formed before the market 
impact mechanism starts having a strong effect, will be of order itrend ^ N 
timesteps in length. The market impact mechanism on its own, is thus clearly 
too weak to stabilize the $G models in order to form a realistic price series. 

7.2 Agent wealth 

Financial agents participating in a real market have finite resources and so 
cannot keep buying and/or selling assets indefinitely This hard cut-off of agents' 
resources in turn imposes a hard limit on the magnitude of price trends. This 
effect was seen clearly in the $G|l3| model where the agents were only permitted 
(or, equivalently because of finite resources, were only able to) hold one position 
at any one time. The $Gp"3| and $Gp"T| models can thus be thought of as opposite 
extremes in this respect, i.e. the first mimics extremely tightly- limited resources 
while the second mimics infinite resources. It seems natural to expect that 
a real financial market lies somewhere between these two extremes; resources 
are large for the market-moving agents but still finite. We can include the 
effect of limited agent resources in our market model by allocating agent i an 
initial capital Cj [0] (and position <f>i |0]) and then updating this capital using 
Ci [t] = Ci [t — 1] — Atfii [t,t — l]p[t] B. The agents cannot then trade at time 
t if the following conditions are met: 

Afa [t, t - 1] > o n Q [t - 1] < A& [t, t-l]p[t-l] 

Afc [t, t - 1] < o n & [t - 1] < o 

In other words, an agent will not submit a buy order unless he at least has the 
capital to buy the asset at the quoted price p [t — 1] , and also will not submit an 
order to short sell if he already holds a short position. If we imposed no limit 
on short selling, an unstable state of the system would exist wherein all agents 
short-sell indefinitely. 

We will refer to this model as $G|Tl]W where W represents 'wealth'. 
Let us initialize this market model with agent wealths such that the agents' 
initial buying power is equal to their initial selling power, i.e. {Ci [0] , <pi [0]} = 
{n X p [0] , n — 1} . Hence initially, the agents have the power to buy or sell n 
assets (N.B. recall that they are allowed up to one short sell). If we then raise n 
from n = 1 upwards, we can see a qualitative change in the dynamical behavior 
of the model between the two extremes of $Gp3| and $Gpd|, with the periods of 
trending growing longer as n increases. To investigate this behavior numerically, 
we fix n and run the model for 1500 timesteps, recording the value of the global 
information /i [t] in the last 500 timesteps t. We then count the number of 
times within this last 500 timesteps that either fi = (implying negative price 
movements over the last m timesteps) or /i = P — 1 (positive price movements) 
occur. We denote the frequency with which these states occur as /trend- If 
the model visited all states of the global information [i [t] equally, we would 
expect /trend = Figure |^, showing the variation of /trend with n, has 

several interesting features. First it can be seen that, as n is increased and the 
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Figure 2: The frequency of occurence of the global information states fi = 0, P — 
1, /trend, as a function of the agents' capital resource level n. The results were 
taken from the last 500 timesteps of a 1500 timestep simulation. The market 
model used was $Gp"l|W, with evolving agent wealths. The model parameters 
were N = 501, m — 3, s = 2, T — 100, and r = 4, with binary (x = sgn) 
strategy rewards. The solid line represents equally visited /^-states. 



consequent wealth available to the agents grows, the tendency of the model to be 
dominated by price trending increases dramatically. Also we see that at low n, 
/trend is below that of the (random) equally visited /x-states case. This is due to 
the high degree of anti-persistence in the system as was seen in the $G[i~l| model. 
The large spread in the results arises from the tendency of the model to exhibit 
clustering of activity states: persistence follows persistence and anti-persistence 
follows antipersistence as described in Section [| 

We note that in the work of Giardina and Bouchaud (see Refs. [[| El| ) 
the strategy payoff function has the same basic form as Eq. |lj, i.e. it is a 
$Gp"3|W- style model. These authors consider proportional scoring (i.e. x = 1) 
and include an interest rate which resembles a resource level L; these features 
will not change the qualitative results presented here. However, in the models of 
Refs. |], it seems that only active strategies are rewarded thereby breaking 
the common perception of a given strategy's success among agents which is 
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arguably a central feature of the binary model structure. We expect this to 
dramatically affect the dynamical properties of the system as the mechanism 
for agent coordination and thus crowd formation has been altered. 

7.3 Diversity and timescales 

The interesting dynamical behavior of market models based on the binary strat- 
egy approach of the MG, arises largely as a consequence of the heterogeneity in 
the strategies held by the agents and the common perception of a strategy's suc- 
cess. The extent to which agents will agree on the best course of action, and the 
consequent crowding, is solely a function of how the 2 P available strategies are 
distributed amongst the population of agents. If the way in which the strategies 
are initially allocated is biased, then the resulting dynamics will reflect this ||. 

The introduction of an agent wealth, as discussed in the previous subsection, 
also brings about diversity in the market model. Even if we initiate the model 
with all agents having an equal allocation of wealth in the form of cash plus 
assets, the wealths of the agents {W{\ will soon become heterogenous as a 
direct result of their heterogenous strategies. Figure || shows the heterogeneity 
of agents' wealth growing with time during a $G0W simulation. After many 
timesteps have elapsed, the distribution of agents' wealth seems to reach a 
steady-state: many agents have lost the majority of their wealth to a minority of 
agents who themselves have accumulated much more. In everyday terminology, 
'the rich get richer while the poor get poorer'. 

The heterogeneity of agents' wealth in the $G|Tl|W model is fed back into 
the system through the buying power of the agents alone. However, although 
wealthier agents have the potential to buy and sell more assets, they still only 
trade in single quanta of the asset at any given timestep, the same as poorer 
agents. It may be more reasonable to propose that the agents trade in sizes 
proportional to their resources j|, a, i.e. 

A^[ M -l]= 7 ^|zi! for a $-V = l 
p[t-l\ 

[t, t-l} = y<t>i [t - 1] for a^*r 1] = -1 

where R* is the highest scoring strategy R of agent i. The factor 7 then enumer- 
ates what fraction of the agents' resources (cash for buying, assets for selling) 
they will transact at any given time. In this system, assets will generally need 
to be divisible, i.e. the sense of agents buying and selling a quanta of the asset 
as in $Gp"l|W is lost. This in turn means that instead of the degree of trending 
being controlled by the level of initial resource allocation n, it is instead de- 
termined by n/7, since this effectively determines the number of trades agents 
can make in any trending period before hitting the boundary of their capital 
resources. Also with this system of trading in proportion to wealth, trends will 
start steep and end shallow as agents run out of resources and thus make smaller 
and smaller trades. Apart from these stylized differences, the system has a very 
similar dynamical behavior to the more straight-forward $Gp"T|W model. 
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Figure 3: Distributions of the agents' wealth Wi at four different times t during 
evolution of a $G[Tl|W model. Initially all agents were allocated the same re- 
sources {Ci [0] , 4>i [0]} = {3p [0] , 2} with the initial price p [0] = 10. Parameters 
for the simulation were N = 1001, m — 3, s — 2, T = 100, and r = 4, with 
binary (x = sgn) strategy rewards. 
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Figure 4: Examples of the dynamical behavior of the price p [t] and volume 
J2iLi M»l 01 the SGpl|W model, for three different degrees of crowding given by 
the memory length m. The parameters of the simulation are N — 501, s = 2, 
T = 100, r = 4, and n = 3, with binary [\ = sgn) strategy rewards. 



Diversity in strategies and wealth are the two big sources of agent hetero- 
geneity that we have covered so far. This agent heterogeneity has led to a 
market model with dynamical behavior which is interesting and diverse over a 
large parameter range. However, the typical price/volume output only starts 
to become representative of a real financial market at higher m, as can be seen 
in Figure ||. As discussed in Section ^, the high m regime represents a 'dilute' 
market where very few traders act in a coordinated fashion. However one would 
expect that a real financial market is not in a dilute phase at all, since it does 
have large groups of agents forming crowds which rush to the market together, 
thereby creating the bursty activity pattern observed empirically. Why then 
does this model, when pushed into the low m regime, produce endless bubbles 
of positive and negative speculation as shown in the left-hand panel of Fig. [|. 
Although a real financial market does show some suggestion of oscillatory bub- 
ble formation, it is nowhere near so pronounced and is certainly not on such a 
short timescaleLl 

To answer this question, we push further the subject of agent diversity. 
Although our agents may have differing sets of strategies and consequently dif- 
ferent wealths, they all act on the same timescale. When we look at charts 
such as Figure ||, we see patterns not only on a small tick-by-tick scale but also 
on a much larger scale. In fact we can identify patterns all the way up to the 

2 We assume implicitly here that a 'timestep' in our model corresponds to a fairly short 
interval in real time. The reason for this is that a timestep is the amount of time it takes for 
an agent to re-assess his strategies. For large, market moving, agents this amount of time is 
likely to be less than one day. 
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'macro' scale of the boom-bust speculatory bubbles. From a knowledge of these 
patterns, we would form opinions about what will happen next and would trade 
accordingly to maximize our wealth. The agents we have modelled, however, 
cannot view the past price-series in this way; instead they are forced to only 
consider patterns of length m timesteps. Patterns of any length greater than 
this, go un-noticed by the agents and hence are not traded upon. This explains 
why these patterns can arise and survive. By contrast, when a pattern is traded 
upon (even in the non-MG framework) it is slowly removed from the market. 
Consider the following pattern: 

time t 1 2 3 4 5 6 7 8 9 10 11 12 13 14 
price p[t] 14 10 11 12 13 15 14 10 11 12 13 15 14 10 

If we were able to identify this pattern repeating, our best course of action 
would be to submit a buy order between times t = 1 and t = 2, i.e. a [1] = 1. 
We could then buy the asset at p [2] = 10. Then between times t = 5 and 6, we 
would submit an order to sell: a [5] = —1 and hence would have sold the asset 
at p [6] = 15. We then continue: a [7] = 1, a [11] = — 1... This ensures we always 
buy at the bottom price and sell at the top. However, trading in this way is 
against the trend as a [t] A [t] < 0. It is in effect minority trading. Hence trading 
to maximize our wealth with respect to this pattern, leads to the weakening of 
the pattern itself just as in the MG. We conclude therefore that the presence 
of strong patterns in the $Gpd|W and other similar market models at low m, is 
simply due to the absence of agents within the model who can identify these 
patterns and arbitrage them out. 

From the proceeding discussion, it is clear that a realistic market model 
should include agents who can analyze the past series of asset price movements 
over different time-scales. A first thought on how to do this, is to include a 
heterogeneity in the memory length m of the agents. In this framework, agents 
look at patterns not only of differing length but also of differing complexity. It 
is therefore appealing to propose a generalization to the way in which the agents 
interpret the global information of past price movements, in such a way as to 
allow observation of patterns over different times-scales but having the same 
complexity (m). This can be simply achieved by allowing the agents to have a 
natural information bit-length r such that the global information available to 
them \i r [t] is updated according to the following generalization of Equation ^ 



The table below then shows how the previous pattern would be encoded by 
agents having r = 1, 2, 3, 4: 



Hr [t + 1] = 2/i r [t] - PR fj, T [t] - - + H [AP [t + l,t + 1 - t}] 
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time t 


1 


2 


3 


4 


5 


6 


7 


8 


9 


10 


11 


12 


13 


14 


price p [t] 


14 


10 


11 


12 


13 


15 


14 


10 


11 


12 


13 


15 


14 


10 


'best' action a* [t] 


1 


1 


1 


1 


-1 


-1 


1 


1 


1 


1 


-1 


-1 


1 


1 


sgn[AP [t,t-l]] 






+ 


+ 


+ 


+ 






+ 


+ 


+ 


+ 






sgn [AP [t,t-2]} 








+ 


+ 


+ 


+ 






+ 


+ 


+ 


+ 




sgn [AP [t,f-3]] 










+ 


+ 


+ 








+ 


+ 


+ 




sgn[AP [t,t-4]] 












+ 


+ 










+ 


+ 





If each agent only considered the past two bits of information, i.e. m = 2, 
then the 'best' strategies for different values of the information bit-length r 
would be as shown in the table below. These 'best' strategies have been obtained 
from inspection of the table above; in particular, by looking at when the different 

m = 2 bitstrings { ,—+,-{ — ,++} occur and seeing the respective 'best' 

action given this bit string a* [t] . 



T 



n[t] 





1 


2 


3 


4 




1 


1 


1 


? 


-+ 


1 


1 


-1 


-1 


+- 


1 


1 


1 


1 


++ 


? 


? 


? 


1 



The 'best' strategies (a R .) in the above table, show a question mark (?) 
next to a particular value of the global information /i [t] . This denotes that for 
this state, the best action is sometimes a R ^ = 1 and sometimes affl = — 1. 
It can thus be seen that only when we include longer time-scale patterns r > 
2, do we get a clear signal of when is the optimal time to sell = — 1). 

Shorter timeframe patterns provide no clear indication when to do soQ. This 
then demonstrates that an agent holding strategies of different bit-length r could 
identify optimal times to buy and sell, and hence arbitrage patterns of length 
very much greater than the memory length m. 

7.4 Exogenous information 

So far this section has discussed how instabilities and inefficiencies in the model 
market (as indicated by repeating, un-arbitraged patterns) could be remedied by 
inclusion of more realistic, sophisticated and diverse strategic agents. However 
the agents we have considered have all acted in the same fashion, i.e. with the 
goal of maximizing their wealth by employing a strategic methodology based 
on the observable endogenous market indicators. In terms of a real financial 
market, we have considered modelling the subset of agents who consider the past 
history of asset price movements as the only relevant information. Although in 
reality we expect this subset to be large, we certainly acknowledge the presence 
of agents who strategically use other endogenous market indicators such as 

3 In this case a single m = 3 strategy could encode enough information to optimally arbi- 
trage the pattern. However in general, for longer patterns, this is less likely to be true. 
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volume as a source of global information. The strategic use of other endogenous 
market indicators simply gives a more sophisticated model within the same basic 
framework. 

What we have so far neglected to include is the presence of agents using 
exogenous information to influence their trading decisions. Possible sources 
of exogenous information are as vast as human imagination itself: they could 
for example range from economic indicators through to company reports and 
rumor, or even 'gut-feeling' and astrology! Within the context of our model 
framework, strategies employed based on these external sources of information 
are un-strategic with respect to the endogenous information, and as such repre- 
sent a stochastic influence. 

Such stochasticity, representing the response of agents to exogenous infor- 
mation, can be incorporated into the models in a variety of ways. For example 
a background level of stochastic action could be assumed at each timestep, or 
be made to appear as a Poisson process with a given mean frequency. How- 
ever, it seems most physically satisfying to incorporate stochastic action in the 
form of an extra pair of strategies for each (or at least some) agent: one buy 
strategy, a R = 1, and one sell strategy, a R = — 1. Agents could then be given 
a probability of using each of these strategies in place of their 'usual' chartist 
strategies. Furthermore, the probabilities of using these strategies could be al- 
lowed to evolve. This system of modelling response to an exogenous signal is 
essentially the 'Genetic Game' of Ref. p0| |. Alternatively, the probability of 
using one of these strategies representing the response to an exogenous signal, 
could be based on an endogenous market indicator. An example of this is the 
mechanism of Rcf. || of switching to 'fundamentalist behavior'. In the model 
of Ref. ||, agents use strategy a R — sgn [po — p [t\] with a probability / which 
increases with \p[t] — p$\, where po represents a perceived 'fundamental' value 
for the asset. This mechanism, which can be thought of as encoding the be- 
havior of irrational fear, clearly acts to break the formation of long trends since 
the probability of trading against the trend increases with the duration of the 
trend itself. More generally, we suspect that among the agent-based market 
models in the literature which manage to reproduce stylized facts, the inclusion 
of some level of stochasticity within the model is crucial for helping to break up 
unphysical dynamics and hence achieve market-like behavior. 

8 Strategy structure 

In the framework of the MG, each strategy a R comprises the elements a R which 
represent a response {—1,1} to each of the possible P values of the global 
information /i (see Section ^). In Section [|, we identified the different states 
of the global information /x as corresponding to different patterns in the past 
history of asset price movements (binary: up/down). We can therefore think 
of a strategy as a book of chartist principles recommending an action for each 
and every possible pattern of length m. Each of these strategy 'books' thus has 
2 m pages, one for each pattern. There are 2 2 possible books an agent can buy 
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and use as guidelines on how to trade. An agent in possession of one of these 
strategy 'books', whichever he chooses to buy, can find a page giving guidelines 
on how to trade in every possible market state. The books are thus 'complete'. 

It seems unlikely however, that in reality such a 'complete' book would exist 
for any arbitrary value of m. Indeed, even if such a book did exist, it is unlikely 
that a given market participant would consider all the pages as being true. 
For example, page one of a strategy book may say that if the asset price has 
fallen three days running, one should sell assets on the fourth day. Page three 
may say that if the asset price has fallen, then recovered and then fallen again, 
it is advisable to buy. The agent holding this book may well believe in page 
one but think that the guidelines of page three are rubbish: in particular, he 
considers the pattern down-up-down to correspond to no trading signal at all. 
This agent would therefore continue to hold any previous position he had if 
he saw the pattern down-up-down. We therefore propose a generalization to 
the strategy structure of the MG to account for the fact that strategy 'books' 
may be incomplete, or equivalently that agents do not trust some of the trading 
signals. 

Before trading commences, each agent goes through each of her (complete) 
strategy 'books' page by page, and decides whether she believes in the trading 
guidelines given for each of the possible P — 2 m market 'signals'. The agents 
choose to accept each page of each of their books with a probability p a . If the 
agent rejects a page of the strategy 'book', they replace the suggested action for 
time t, i.e. a^*' G [—1, 1], by a null action a^P = *■ When trading commences, 
the agents are then faced with deciding what course of action to take when 
their strategy 'books' register the null action at time t. One type of possible 
behavior would be to maintain their prior course of action, i.e. af ] = af~ l \ 
Another alternative behavior would be to maintain their prior position, i.e. 

tf>i (t + 1)~ = 4>i[i\. (This results in a^f 4 ' = in $GjTT| type models and 

af ] = af~ 1] in $Gg| models). 

We now examine the qualitative differences between a model with this in- 
complete strategy structure, and a model with complete strategies. For this 
purpose we will assume a GCMG type model, i.e. a model with strategy payoff 
given by Equation Figure |E| shows the resulting price and volume from two 
market simulations with incomplete strategies, the probability of non-null ac- 
tion being p a = 0.3. These are compared with the GCMG which has complete 
strategies. 

The top-left chart of Figure ^| shows price and volume for a market where 
agents maintain their previous course of action when they encounter a null 
strategy action. This market shows high volatility in price, a high number of 
very large movements, and a background of high volume trading with occasional 
sharp spikes. We suggest that this behavior arises from the tendency of agents 
to keep repeating their previous action in the absence of a new market pattern. 

4 We use the GCMG to contrast the effects of incomplete strategies, since its dynamics are 
familiar and have been well-studied. However our previous comments regarding the short- 
comings of the MG strategy payoff structure, should be kept in mind since they still hold. 
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Figure 5: The contrast between the behaviour of models with incomplete strat- 
egy 'books' to those with complete ones. The top two figures show price and 
volume for a GCMG type model with incomplete strategies, p a = 0.3. The top 
left chart represents a market of agents who maintain their previous action when 
they encounter a null strategy response a^f*' = *. The top right chart represents 
a market of agents who maintain their position. The bottom chart shows the 
behavior of the GCMG for comparison. The parameters in all the models are 
N = 501, s = 2, T = 100, and r = 4, with binary (x = sgn) strategy rewards. 
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Clearly then the 'large-movement' states of up-up-up or down-down-down will 
be occupied for longer than the 'ranging' states (up-down-up etc.) because 
these are the only states of /j, which can map onto themselves (// [t] = fi [t + 1] = 
H [t + 2] . . . ) thus providing no new market pattern. 

The top right chart of Figure || shows price and volume for a market where 
agents maintain their previous position (i.e. a^f = in this model) when they 
encounter a null strategy action. The volume in this market simulation is very 
low, giving a consequent low volatility. This is due to the fact that the agents 
will not trade unless they receive a market signal for which they have a non-null 
strategy action. This model still shows a high probability for price movements 
which are large compared to the volatility. These large movements occur in 
moments of high agent coordination, i.e. when many agents find they have a 
strategy with the same non-null action. However in contrast to the previous 
model, these large movements don't seem to persist over many timesteps. 

It is interesting to compare the statistical features of these models featur- 
ing incomplete strategies, with the features of the GCMG. Figure || shows the 
volatility and kurtosis of the price movements AP [t + l,t] for a model with 
incomplete strategies, as a function of the probability p a that a strategy has a 
non-null action for a given global information state. The market model used 
has the structure of the GCMG (N.B. for p a = 1, it is the GCMG) and features 
the behavior wherein agents maintain their previous course of action when they 
encounter a null strategy action. As a consequence, the volatility of the price 
movements rises as p a is reduced since it leads to more persistence of agent 
action and consequently bigger asset price movements. This affect can also be 
seen in the kurtosis which grows rapidly away from the Gaussian case, since the 
probability of large moves increases with falling p a . [For more details on the 
nature of large movements in the GCMG, we refer to Ref. pl[|.] 

9 Market clearing 

In the preceding sections we have discussed mechanisms through which financial 
market agents come to make trading decisions about whether to buy or sell an 
asset. In Section || we made some suggestions about how the combined action 
of all these agents' decisions produced a demand, and how that demand then 
affected the movement of the asset price. So far we have not explicitly mentioned 
a mechanism by which the market clears, i.e. by which sellers of the asset meet 
buyers. The discussion of specific clearing mechanisms amounts to a discussion 
of specific market microstructure. However our goal here is to seek to be as 
general as possible, and hence not limit the scope of our market models to any 
particular sector or style of financial market. We thus limit ourselves here to a 
simplistic discussion of market-making. 

The first important aspect of the models we have considered so far is that 
the agents make a decision about whether to buy or sell assets at time t based 
only on the information up to (but not including) time t. The orders as such 
are considered as 'market orders' because they are unconditional on the actual 
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Figure 6: The volatility and kurtosis for a market model with incomplete strate- 
gies, as a function of the probability p a that a strategy will have a non-null action 
for a given global information state. Circles correspond to values taken over the 
last 1000 timesteps of a 2000 timestep run. The model parameters are N — 501, 
s = 2, T = 100, and r = 4, with binary (x = sgn) strategy rewards. 
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traded price p [t] . Orders whose execution depends on the traded price p [t] are 
known as 'limit orders' and are beyond the scope of the model types presented 
here. 

The presence of only market orders in our model brings us to the second 
important consideration: the number of buy orders in general does not balance 
the number of sell orders, i.e. in general D [t] ^ since all orders seek execution 
at time t irrespective of the price. There are two options then of how the market 
could function. The first option is to assume that the number of assets sold by 
the agents must equal the number bought by the agents. In this case the total 
number of assets traded is given by, 

fasscts [t] = min [n buy [*~] , n se u [t~]] 

where ntmy is the number of agents who seek to buy, and n sc \\ the number who 
seek to sell. With this system, the agents in the majority group (buyers or 
sellers) only get their order partially executed: for example if n^ uy = 40 and 
n so n = 50, the sellers can each only sell Assets [t] /n se ii = 40/50 = 80% of a 
quanta of assets. This is the system proposed in Ref. An alternative system 
assumes that there is a third party, a market-maker, who operates between 
buyers and sellers and makes sure that each and every order is fulfilled by 
taking a position in the asset herself. We then have that: 

^assets [t] = max [n buy [t~] , n soU [t~]] 

and that the market-maker's inventory is: 

4>M [t] = 4>m [t — l]—D [t~] (16) 

The market-maker, unlike the agents who just seek to maximize their wealth, has 
the joint goals of wealth maximization and staying market-neutral, i.e. as close 
to (f>M [t] = as possible. If the nature of the market is highly mean-reverting, 
the second of these conditions will be automatically true since Y^j=o D [t] ~ 0. 
In these circumstances, the market-maker should employ an arbitrage-free price 
formation process such as Equation || or Equation |3| If on the other hand the 
market is not mean-reverting, then the market-maker's position is unbounded. 
In these circumstances a market-maker should employ a price-formation process 
such as Equation [5] which encourages mean reversion through manipulation 
of the asset price based on X) j=o D W > as described in Section ^. A market- 
maker using a price formation such as Equation [5] can however be arbitraged 
by the agents. Whether the market-maker is actually arbitraged is then purely 
dependent on the underlying dynamics of the market. 

Let us demonstrate this by following Ref. [^) using a $G|l~3| model. For this 
model we have: 

D[t~] =A[t-l]-A[t-2] (17) 

as shown in Section ||. From Equation |l~6| we then get that 4>m [t] = X)*=o D [t] — 
<Pm [0] — A [t — 1] This immediately tells us that the market-maker's inventory 
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is bounded. The natural choice of price formation process to use would then be 
the arbitrage-free case of, for example, Equation ^. If we combine Equation |^ 



with Equation 17 



we arrive at: 



p[t]=p[0]e^ A (18) 



We can thus see how the $G|13| model produced the tightly bounded, jumpy 
price series seen in Figure [l]: the price at time t is no longer directly dependent 
on the price at time t — 1 and is bounded by p [0] e~~ <p [t] < p [0] . 

Given the behavior of the $G|l3| model, the use of the price formation process 
of Equation || seems rather unnecessary: the market maker does not need to 
manipulate the price in order to stay market-neutral on average. However, let us 
demonstrate the effect of using Equation || as in Ref. || . Combining Equations 
|and|l| with the market maker's inventory of <Pm [t] = — A [t — 1] we arrive at: 



P [t] = p [t - 1] exp 



(19) 



Unlike Equation [l^, Equation [l9] gives us again a multiplicative price process 
for finite values of Am- Although it seems an inappropriate step, we would 
thus expect that introducing Equation || as a price formation process would 
help regain some of the 'stylized features' of a financial market inherent from 
the multiplicative price process. However, closer consideration of Equation fL9| 
reveals something different. Let us consider the case of A [t — 2] = A [t — 1]. 
In this case the price change is simply an increasing function of A [t — 1] . If 
more agents are long rather than short at time t (A [t — 1] > 0), then the 
price movement AP [t, t — 1] is positive. The agents will then have benefitted 
from a positive increment in their notional wealth from simply holding their 
position. This manipulation of the (naive) market-maker leads to an unstable 
state whereby the agents arbitrage the market-maker simply by holding their 
long/short positions and watching the price rise/fall. This unstable state of 
the market will occur with increasing probability as the agents become more 
coordinated in their actions, i.e. as m is decreased. For high m, where there 
is no longer significant agent coordination and no crowding affects, the $GfL3| 
model with Equation |l^ for the price formation is able to avoid the unstable 
attractor and thus produce more 'market-like' time-series as demonstrated in 
Ref. [|. We can also examine the changing wealth of the market-maker by 
simply employing Equation p. We know that (f> M [t] — —A [t — 1], so Equation 
^| combined with Equation |l£ gives us: 

AW M [t + l,t] = cf> M [t] AP [t + 1, 1] 

= ~A[t]A[t-l]- (J^~\) Mt-lf (20) 

In the case of Am = A as in Ref. Q , we see can see - by taking a time average 
of Equation 20 - that whether the market-maker's wealth increases or decreases 
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with time will depend on the dynamics of the market. If the market is per- 
sistent as in the unstable case at low m, the market maker loses wealth since 
(A [t] A [t — 1]) > 0. Conversely if the market is anti-persistent, the market- 
maker will gain wealth since (A [t] A [t — 1]} < 0. 

At first sight, it therefore seems that the $G 13 model can generate a 'reason- 
ably market-like' state wherein the price is not unstable, it exhibits market-like 
dynamics and has a market-maker who is able to make a profit from her role. 
However, in order to generate this state it was necessary to use what seems to 
be an inappropriate market-making mechanism and also push the model into 
the uncrowded phase. Neither of these choices seem desirable in the quest for 
producing a realistic market model. 



10 Conclusion 

Following the introduction to the community of the Minority Game, there have 
been many proposed agent based market models, many of which exhibit the 
'stylized facts' of financial market timeseries in certain parameter ranges. It 
seems however that many of these market models include one or more basic 
assumptions which seem implausible when compared to the actions of real fi- 
nancial market agents. 

With the first 'wave' of market models it seemed remarkable that so many of 
the stylized features of real financial timeseries were reproduced so well. With 
more and more market model contenders entering the community, each with 
a quite diverse set of assumptions, it started to look as though it was easy 
to find models which would reproduce realistic market-like dynamics. It has 
been proposed jl5| that general systems which have activity based on a waiting 
time problem, will all enjoy the scaling properties seen in financial market data. 
Notwithstanding this observation, we believe that it is not at all obvious how one 
can produce a self-consistent market model which embodies, in a non- stochastic 
way, the behavior of realistic financial market agents, and yet can reproduce 
the 'stylized facts' of real financial markets. As we have shown in this work, 
adaptation of the MG payoff structure to incorporate a more realistic objective 
function results in a model increasingly dominated by trending as agents' re- 
sources are increased. As a result of this we are forced to look to a wider class 
of agent behavior and heterogeneity in order to recover the dynamical behavior 
we are looking for. 

This paper has provided a discussion of different aspects of market models 
and how they interact with each other. Instead of proposing a unique new mar- 
ket model of our own (thereby adding to the increasing family in the literature) , 
we have examined the different features one may want to include in a market 
model, and then discussed what their effect would be. We hope that the re- 
sulting discussion has managed to provide a basic of toolbox of model features, 
brought together by a common formalism. Perhaps some optimal combination 
of a subset of these model features will provide a new self-consistent, realistic 
yet minimal market model. However it is likely that one needs a diversity not 
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only of agent parameter values but also of agent behavior, if one is intent on 
forming the market model. 

Finally we emphasize that this paper falls into the category of an interim 
progress report on our research in this area to date - a synthesis of our work 
over the past few years and thoughts, particularly in light of recent proposals 
for modified agent-based models. We would therefore welcome any comments 
and, in particular, criticisms of our results and opinions. We also thank the 
many people with whom we have enjoyed discussions about financial modelling, 
both within and outside the Econophysics community. 
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